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Abstract 

In this paper we propose a novel partition-based distributed state estimation scheme for non-overlapping sub¬ 
systems based on Kalman filter. The estimation scheme is designed in order to account, in a rigorous fashion, for 
dynamic coupling terms between subsystems, and for the uncertainty related to the state estimates performed by the 
neighboring subsystems. The online implementation of the proposed estimation scheme is scalable, since it involves 
(i) small-scale matrix operations to be carried out by the estimator embedded in each subsystem and (ii) neighbor- 
to-neighbor transmission of a limited amount of data. We provide theoretical conditions ensuring the estimation 
convergence. Reconfigurability of the proposed estimation scheme is allowed in case of plug and play operations. 
Simulation tests are provided to illustrate the effectiveness of the proposed algorithm. 


I. Introduction 

In many different engineering areas there has been, in the last years, a huge effort to develop algorithms 
and protocols allowing a number of interconnected, possibly spatially distributed systems, devices, sensors, and 
actuators, to operate cooperatively and to possess self-organization capabilities. Notable examples include smart 
grids ||24|, environmental monitoring systems 1^ . large-scale irrigation and hydraulic networks a, m, and 
multi-robot/vehicle systems m, ca. 

Related research on systems of systems IMl or cyber-physical systems a is nowadays fostered, pursuing several 
challenges, including the design of hierarchical and distributed monitoring and control systems with reliability and 
robustness properties with respect to uncertainties, changing environment, communication failures, etc. 

In particular, theoretically sound distributed monitoring and state estimation methods are necessary to allow for 
optimal managing of sensor networks. As also discussed in the survey paper ll^ . two main classes of estimation 
techniques for distributed smart sensing schemes are presently under investigation. They are generally both referred, 
in the literature, to as distributed state-estimation algorithms. While a widely-considered problem concerns the case 
where the full state of the system is estimated by all subsystems, e.g., based on consensus and diffusion strategies, 
e.g., ED, El, ca, in this paper we focus on partition-based estimation. The latter consists of estimating, for 
each sensor, only a part of the state vector of a system, using information transmitted by other neighboring sensors. 
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This problem gives rise to low-order estimation problems solved in a distributed way, and is particularly useful 
when the observed systems are large scale ones, e.g., power networks ll^ . Il23l . transport networks EH, process 
plants ll3^ . and robot fleets 121]. 

Concerning linear discrete-time systems, recent contributions include M, El, 03, Ca, 03, ED, M, ED, 

ES- 

Among these, EH, ini, El, ED propose Kalman filter-based estimation schemes suitable for systems affected by 
stochastic noise. The papers ESl . iMl propose methods based on local Kalman prediction equations (and neglect 
the dynamic interconnection terms) and on consensus steps to account for possible overlapping states between 
pairs of subsystems. The paper iflTlI proposes a two-step Kalman Alter, where the correction step is performed by 
each subsystem based on local measurements, while the prediction step is based on approximating the centralized 
error process using a distributed iterate-collapse inversion algorithm for L-banded matrices lITbl . As in 138] . ES, 
a consensus step is used to optimally account for overlapping states. Finally, in E9l . a prediction/corrector-based 
method for multi-rate systems is proposed. It is worth noting that sufficient convergence conditions are provided 
just in E4l which, in case of non-overlapping subsystems, basically amount to the stability of the original system. 
The papers lfT3 . lfT3l . ESI . ESI assume that the system is affected by bounded noise and guarantee, under suitable 
conditions, convergence of the estimator and the fulfillment of constraints on local states, e.g., in lfT2l . or estimation 
errors, e.g., in 113, EH), E3- Finally, m proposes an approximated distributed Alter based on the moving horizon 
estimator studied in Cl. A different - cooperative and iterative - approach based on Lagrange decomposition is 
proposed in C31, where continuous-time systems are considered. 

The conditions required for convergence of the estimators discussed in all the mentioned papers, where available, 
(with the notable exception of E3) require a centralized synthesis/analysis phase which (/) limits the application 
to very large-scale systems and (ii) requires a complete re-design in case of configuration changes (e.g., addi¬ 
tion/removal of subsystems or sensors). On the other hand, in ESi the design phase (guaranteeing global properties) 
is distributed, i.e., the state estimator embedded in each subsystem is devoted to solve a local design problem. This 
has paved the way to a plug-and-play (PnP) implementation E3, ESI, which confers flexibility, reconfigurability, 
and reliability to the estimation architecture. 

In this paper we propose a novel partition-based distributed state estimation scheme based on Kalman Alter (denoted 
DKF) for non-overlapping subsystems affected by stochastic noise. The estimation scheme is designed to account 
for dynamic coupling terms between subsystems, and for the uncertainty related to the state estimates performed by 
the neighboring subsystems. This is done in a conservative but rigorous way by means of suitable covariance matrix 
bounds. The online implementation of the proposed estimation scheme is scalable, since it involves (i) small-scale 
matrix operations to be carried out by the estimator embedded in each subsystem and (ii) neighbor-to-neighbor 
transmission of a limited amount of data. Concerning the design/analysis phase, we provide both centralized (both 
with suitable linear matrix inequalities and with aggregate small gain-type arguments) and distributed scalable 
conditions to be verified ensuring the estimation convergence. The latter are then used to provide a fully distributed 
and PnP implementation of DKF. More specifically, distributed reconfigurability conditions are provided in case a 
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subsystem is added to or removed from the network, and also in case PnP operations involve sensors. 

The paper is structured as follows. In Section|II]we introduce and motivate the distributed Kalman filter equations, 
while in Section Hill we provide the main conditions for convergence. In Section HVl we discuss how DKF can be 
designed in a distributed fashion and the resulting application for PnP operations. Finally, in Section|V]the algorithm 
is tested both on an academic example and on a benchmark case study, and in Section |VT] some conclusions are 
drawn. All proofs are postponed to Appendix A for better readability. 

Notation 

The symbols > and > are used to denote semi-definite positive matrices and definite positive matrices, respectively. 
The symbols and ££ are used for brevity to denote the Riccati equation update and the optimal Kalman predictor 
gain, respectively, i.e., 

^{P,A,C,Q,R) = APA^-APC'^iCPC'^ +R)-^CPA^ PQ 

^{P,A,C,R) = APC^(CPC^+R)-' 

where P, A, C, Q, and R are matrices of appropriate dimensions. Finally, the cardinality of a set is denoted 
with \^\ and the spectral radius of matrix A is denoted (J(A). 


II. The distributed Kalman filter 


A. Statement of the problem 


Consider M interconnected systems, each described by the following equations: 

Xi{k+\) =AiiXi{k) + Y.j^iAijXj{k)^Wi{k) 

yfk) =CiXi{k) + Vi{k) 

where Xi{k),Wi(k) G K"' and yi(k),Vi{k) G M^’'. We assume that w,(k) and v,(k) are zero-mean white noises, for all 
i = 1,... ,M, and that E{w!(k)wJ (k)} = Qi5ij, E{v,(A:)vj(k)} = Ri5ij (with P; > 0 for all ; = ,M), and that 

K{wi{k)v^ (h)} = 0 for all i,j = 1,... ,M, and h ,k>0. For i G {1,. ■. ,M}, we define with the set of neighbors 
(also denoted predecessors in 1261) of subsystem i defined as = {j\Aij 7 ^ 0 } while is the set of successors 
of subsystem i defined as = {y | i G cAy}. In our setup we assume that subsystem i can exchange information 
with its neighbors. Note that i is in general included in and .A^. 

Collectively, if we define the variables x{k) = {xi{k),... ,XM{k)), y{k) = (yi(k),... ,yM{k)), w{k) = {wi{k),.. .,WM{k)), 
and \{k) = (vi (k),..., we can rewrite ([T]l as 


x(k-l-l) =Ax{k)+w{k) 
y{k) =Cx{k)+\{k) 

where C =diag(Ci,... ,Cm), Q =diag(2i,... ,0^), R =diag(Pi,... ,Pm), and 


( 2 ) 


All . 

A\m 

Am\ 

■ Amm 
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The optimal centralized Kalman predictor ||6| for system (|2|i is 

Xc.(^+1) =AXf(^)+Lc(^)(y(^)-Cxc(^)) (3) 

where Xc{k) denotes the one-step optimal predictor of x{k). According to the classical Kalman prediction theory, 
the optimal gain is 

L,(^)=,5^(n,(^),A,C,R) (4) 

where fir (A:) is the centralized Kalman prediction error covariance matrix and is computed iteratively using the 
Riccati equation 

n,(^+i) = ^(n,(^),A,c,Q,R) 

= (A - L{k)C)ncik) (A - L{k)cy + Q + L(A:)RL(A:)^ 


B. Distributed prediction scheme 

As clear from OJl-®, the optimal centralized Kalman predictor for system (|2]| is based on the iteration of the 
Riccati equation (|5]l, which requires a global knowledge of the system and, in general, leads to a matrix gain which 
has not the sparsity properties of the dynamic system (i.e., of matrix A). 

In contrast, in this paper we seek for a distributed observer implementation, meaning that; (0 at most data originated 
by neighbors are used by the local observers, to reduce the communication load of the scheme; (//) information 
about the model of the overall system is not required to be stored by each local observer, but at most information 
concerning the neighboring subsystems; {Hi) the computational load required by each local filter is scalable. 

In line with this we propose an estimation scheme of the type 


where 


Xi{k+\)= ^ {AijXj (k) + Lij {k) {yj {k) - Cjxj {k ))} 


Lij{k) =AijPj{k)Cj{CjPj{k)Cj +Rj)-\ 


(6) 

(7) 


The matrices P, (k), i= 1,... ,M are updated according to the following distributed equation. 


Lje^(AijPjik)Ajj-A,jPjik)Cj 

^ \ (o) 

iCjPj{k)Cj +Rj)-^CjPjik)Ajj)+Qt 

where, for all ij= 1,... ,M, we have defined A,j = ^/^Aij, Ci = ^/gQ, and R,- = qR,, where g, = \,Yi\. Note that (|7]) 
and ([8ll are equivalent to Lij{k) =^{Pj[k),Aij,Cj,Rj) = ^{Pj{k),Aij,Cj,Rj) andf’i(k+1) = Y.je.J^^{Pji^)^'^ij,Cj,0,Rj) + 


Qi = 'LjeJ^,Qj3S{Pj{k),Aij,Cj,0,Rj)PQu respectively. 

Equation (|6]l is distributed, i.e., Lij ^ 0 only if Aij ^ 0. Therefore, the computation of Lij can be done distributedly 
and communication is required between local state estimators of dynamically interconnected subsystems only. 
Concerning the scalability of the algorithm observe also that, for / G {1,... ,M}, subsystem i permanently stores in 
memory only the matrices Qi, Ri, An, Ci and, for j G ,Ai, Aij, Cj and R,-; on the other hand, the information which 
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must be transmitted and temporarily stored at each time step consists of yj, xj, Pj, for all j G The DKF 
algorithm is formally described in Algorithm 1. 

Algorithm 1 DKF algorithm 
Memory requirements 

For ! S {1,... ,N}, subsystem i stores in memory 

- Permanently Qi, Ri, An, Q, {Aij,Cj,Rj;j G 

- Temporarily, for each k > 1, {yj{k),Xj{k),Pj{k),Lij{k)-,i G 


On-line implementation 

At each time step k>\ subsystem i: 

1) Measures yiik) 

2) Broadcasts to its successors the quantities yi{k), Xi{k), and Piik)-, 

3) Gathers from its neighbors the information \^yj{k),Xj{k),Pj{k)-,j G ^}; 

4) Computes the gains {Lij{k)-,j G c/lf} as in 0; 

5) Computes the estimate Xi{k+ 1) and the matrix Pi{k+ 1) as in (|6]l and ([8]l, respectively. 


C. Main properties 

Let us dehne x(A:) = {x\{k),... ,XM{k)), and the distributed hlter estimation error e(^) =x{k)—x{k). From (O 
and (|6]l we obtain that 

e{k + 1) = {A-L{k)C)e{k) -L{k)\{k) +w{k) (9) 

where L(A:) is the matrix whose block entries are Lij{k). Let Ild{k) =var(e(k)). From (|9ll the following is obtained. 

n^(k+l) = (A-L(k)C)nj(k)(A-L(k)C)^-L(k)RL(k)^+Q (10) 

The following result can be derived. 

Lemma 1: Assume that the pair (A,G) is stabilizable (where GG^ =Q) and that there exist symmetric matrices 
Pi>0, i = I,... ,M such that 

Pi> Y. i%{Pj,Aij,Cj,0,Rj) + Qi, (11) 

for all ! = 1,... ,M. For all i,j — 1,... ,M, let Ltj = ^{Pj,Aij,Cj,Rj) and let L be the matrix whose block entries 

are Ltj. Then, the matrix A —LC is Schur stable. □ 

Thanks to Lemma [1] a simplified version of the DKF Algorithm 1 can be devised: assuming that each subsystem 

i stores in memory matrix P,, i = with property (fTTl l. then it is sufficient to set F’,(k) = P, and Ljfik) = 

Lij = ^{Pj,Aij,Cj,Rj) for all k to guarantee that the estimation error e{k) is a stationary process. Therefore, the 
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error covariance of this modified scheme is asymptotically convergent to a bounded definite positive matrix, i.e., 
lim*.^,x,n^(A:) = tld for some positive definite matrix 11^. 

In case the Algorithm 1 is implemented, under the assumption that there exist steady-state solutions of ® Pi > 0, 
i = 1,... ,M, the next result can be proved. 

Proposition 1: Consider the DKF Algorithm 1. Assume that f^(l), i = 1,... ,M, are such that there exists P, with 
the property that 

limP,(;t) =.P;. (12) 

A:->oo 

Let P =diag(Pi,... ,Pm)- Then, there exists a positive definite matrix 11^/ such that lim^t^oo T^d{k) = and tlj < P. 
□ 

Note that, under the validity of (fT^ . then in steady state conditions also (fTTI) is verified. Therefore the DKF 
Algorithm 1 provides a stationary equation error; also. Proposition [T] states that, for i= matrix P, plays 

the role of an upper bound of the covariance of the prediction error Xi{k) —Xi{k) in steady state. 

Observe that Lemma[T]and Proposition [T] require the existence of matrices P,, i= 1,... ,M, such that either property 
or property (fTSl i are satisfied. However, differently from the centralized Kalman filter, these properties are not 
guaranteed by standard detectability assumptions on the system. 

In this paper we provide conditions under which these properties can be verified. In particular, in Section HII] we 
discuss the conditions allowing the application of centralized design procedures while, in Section IIVI we provide 
a distributed design procedure. 

We conclude this section with a couple of remarks. 

Remark 1: Consider the DKF algorithm and assume that (fTSl i holds true. Let be the steady-state covariance 
of the prediction error for the centralized Kalman filter. Obviously, if n£/(l) =11^(1) then nc(k) < Tlj{k) for all 
k>l. 

Remark 2: Assume that (fTlTi holds true and consider the simplified DKF algorithm described after Lemma [T] 
Then, also in this case, the asymptotic covariance of the prediction error Xi{k)—Xi(k) is upper-bounded by the 
matrix P,. 


III. Centralized design 

In this section we address the problem of providing (/) conditions that can be used to guarantee a-priori the 
validity of properties (fTTI) or (fTZl i and {ii) practical methods for computing them. First, in Section IIII-AI we will 
analyze (fTTI) through a linear matrix inequality approach; secondly, in Section IIII-BI we will provide an aggregate 
design procedure, based on small gain arguments, to guarantee (fTTli 

A. Design using LMFs 

In this section we provide a practical method based on LMFs for computing, if possible, matrices P, verifying 
(E]). Then, as already highlighted, if we set Pi(k) = Pt for all k and for all / = 1,... ,M, then it is guaranteed that 
this simplified version of the DKF algorithm has suitable convergence properties in view of Lemma [T] Also, its 
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suboptimality features are discussed in Remark |2] 

Using LMI’s we aim to compute (see Lemma [T]i P,, i = I,... ,M verifying 

Pi > L i^ijPAl -AijPjC] {CjPjCj + ft (13) 


Provided that Pj is non singular for each j G c/lf, the algebraic inequality (fOT l is equivalent to 


Pi > Lje^AijPjiPj+PjCjRj^CjPj)-^PjAjj + Qi 
> ZjeMijiPj' +CjP7'C,)-'A,?;. + e, 

thanks to the application of the matrix inversion lemma. Inequality (fl4ll can be cast as the following LMI 


(14) 


Pi A,iAi 
* A] 


Am Am G, 

0 0 


>0 


(15) 


* 0 ... Am 0 

* 0 ... 0 I 

where G, is defined in such a way that G,Gf = Qi and, for all j = I,... ,M, Aj > {P^^ If we define 

Plj=PJ^, the latter inequality can be written as 


I f2, + C7P7C, 


> 0 . 


(16) 


Finally, the equality fiy =Pj ^ can be managed using the recursive approach proposed in ||9l- Indeed, we solve the 
following LMI 




I 


>0 


(17) 


^ Pj 

and, at the same time, we minimize the additional cost function trln^P^}. The problem can be managed using the 
recursive cone complementarity linearization algorithm discussed in 0. 


B. Design using small gain arguments 

In this section we investigate conditions ensuring the validity of (fT^ . In particular, the following result addresses 
the offline design issue providing an aggregate and lightweight analytical condition, which relies on small-gain 
arguments. First, the following assumption is required. 

Assumption 1: For subsystem i, An is invertible. □ 

We will also need one of the following assumptions for properly initializing P,(l) for the implementation of 
Algorithm 1. 

Assumption 2: For subsystem i 

(i) {Aii,Ci) is detectable; 

(ii) {Aii,Gi) is stabilizable, where G,- verifies G,Gf = Qi', 
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Assumption 3: For subsystem i 

(i) {Aii,Ci) is detectable; 

(ii) {Aii,Gi) is stabilizable; 

Note that, while Assumption |2] is required to define, for a given subsystem i, Pj^ as the unique semi-positive definite 
solution to the local Riccati algebraic equation Pj^ = ^{Pj^,Aii,Ci,Qi,Ri), Assumption |3] allows to define Pf as the 
unique semi-positive definite solution to P^^ = ,Aii,Ci,Qi,Ri). 

Let us now define full rank n, arbitrary transformation matrices //,, i = i.e., //, G (introduced 

for reducing, if possible, the conservativity of the results stated next) and gains Z,, selected in such a way that 
Fj =Aii — LiCi is Schur stable. Define also F, = HiFiH^^ and A,y = HiAijHj^, for all j = I,... ,M. Finally we define 

_ f = 0 ^ if i = i 

\ Tij = if;Vi 

Scalars /r, > 1, A, G [0,1) are defined in such a way that Ill'll < /r,A/’. We introduce a further assumption. 

Assumption 4: For some values of Z,, Hi, (i) <j{Fi) < 1 for all i = 1,...,M, and (ii) <7(r) < 1. 

Note that, a necessary condition for the existence of matrix Z,- guaranteeing that <7(Z)) < 1 is that (A,i,C,) is 
detectable, i.e.. Assumption |3j therefore, the latter is implicitly required by Assumption |4] 

Theorem 1: If Assumption [T] holds for all / = and under Assumption IH there exist P,- > 0 for all 

1 = 1,... ,M such that, Pi{k) Pi as °° if one of the following initializations is used: 

a. P,(l) = 0 for all 1 = 1,...,M. 

b. P,(l) = pP if Assumption |2] holds for all i= 1,... ,M. 

c. P,(l) = P^ if Assumption [3] holds for all / = 1,... ,M. □ 

Regarding Assumption @1 provided that Assumption [3 (i) is verified, it is always possible of find Z; such that 
(y{Fi) < 1 for all / = 1,... ,M. Note that, in case the system has a cascade topology (i.e., if it admits a lower - or 

upper - block triangular form, 1331), F is block triangular, and therefore Assumption |4]can be easily verified. 

On the other hand, for more general system structures, we need to retrieve a suitable “decentralized” change of 
coordinates and, at the same time, a suitable “auxiliary” decentralized linear observer, for which (J(Z)) < 1 for all 
i = 1,...,M and the corresponding matrix F is stable. This amounts to a design problem, which can be cast, for 
example, as the following optimization. 

min C7(r) (18a) 

subject to the definition of F and to 

C7(r) < 1 (18b) 

C7(Z;) < 1,/= 1,...,M (18c) 

To reduce the computational load of (fTsT i the values of //, can be constrained. For example, one can try to minimize 

the terms /i, by constraining Hi to take values corresponding to which Z) = HiFiH^ * is diagonal (provided that Z, is 
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diagonalizable and has real eigenvalues), or one can set Hi = I„.. The optimization (fTSl l is nonlinear, and therefore 
a suitable initialization is fundamental, for example selecting Fi as the Kalman predictor gains. 

To reduce the computational complexity and to allow for flexible and reliable operation, in next Section we provide 
a distributed and scalable design procedure to be applied at each subsystem level. 

IV. Distributed design and plug and play features 
I n many practical applications, it is of interest to perform the design of the DKF in a distributed fashion, i.e., to 
have a set of conditions to be verified locally by each subsystem, possibly using pieces of information provided by 
the neighboring subsystems. 

Focusing on the main assumptions of Theorem [T] while Assumptions [T] and |4] (i) are local conditions, to be verified 
at a single subsystem level. Assumption |4] (ii) is centralized (although aggregate), since it involves information 
concerning the overall system. We now introduce the following assumption, providing a conservative, yet distributed 
and very simple, condition, that must be verified at a single subsystem level by each subsystem, that implies the 
Schur stability of T, as proved in Proposition |2] stated below. 

Assumption 5: For all / = 1,... ,M and for some values of Z,,, Hi, it holds that 

M 

p, = L < 1 (19) 

;=i 

Proposition 2: If Assumption |5] holds, then Assumption |4] (ii) is verified. □ 

As it will be shown in the remainder of the section, this result allows for PnP operation. The PnP scenario consists 
of the case when one or more subsystems (each described by ([T]i) or devices (and specifically a transducer) is added 
to or removed from the interconnected system. 

Before to proceed, the following standing assumption sets the scenario where PnP operations take place, assuming 
that the PnP event occurs at time instant k = Tp^p. 

Assumption 6: 

- For k < Tpnp, Assumptions |4] (i), |5] and [T] (for all i= 1,... ,M) hold. 

- At k = Tp„p the updates (O are in steady state, i.e., Pi{Tp„p) = P, for all ; = 1,... ,M. □ 

It is important to remark that, when PnP operations involving subsystems take place, the number of successors, 
for some subsystems, may change. Denote with the set of successors of subsystem i after the PnP event and 

In general it holds that ^ q. From this, it also follows that the matrices Aij, Q, and P, must be 

redefined, i.e., Af, = xf^Aa = \ l^Aa, = \ f^Ci = \ ^Ci, and Rf = C^Ri = ^Ri- Importantly, in case 

‘7 y "^7 y Sy ^ V ^ y ^ ' 71 

ii^ > 9, this may prevent the detectability of the pair to hold, which may jeopardize the verifiability of 

Assumption |4](i). We also assume that Hi and L,- are not redefined, for i= 1,... ,M after the PnP event. From this 
it follows that, for all ij = 1,...,M, ||Aj(At.)^^|| = ||A,jAtZ||. 

A. Plug-in of a subsystem 

Assume that, at step Tpnp, the subsystem (M+ 1) is introduced. For each i = 
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• if ^ ^ then ')^(m+i) = 0. Also, since = Q, then = A/. In view of 

this, p+ = If+' = I" 1 y,j = P, < 1; 

• if / S =5^+1 but i ^ ,/fM+i, then = q: Therefore = Fi, = p,-, and = A,-. However, since M+1 G 
YiiM+i) > 0. Therefore pA = 1"+/ Yij = Pi + Yi^m+i) > PP 

• if / G but i ^ ,5^+1, then Yi(M+\) = 0- However, gA = q ■ + 1 and therefore = y/gt + l {An — LiCi) = 

^yj^i’+ryigPi may not be Schur stable. If fX is stable, pA = p,- but, at the same time, XX = (g,- + 1 )/qA; > 

A,. In view of this yX = (^l — A?)/(l — AA^)^.^. for all j =1,... ,M. Therefore p+ = (1 — A,^)/(l —XX^)pi > pf, 

• i G Ho/Im+i, the Schur stability of FX is not guaranteed. If fX is Schur stable, we can compute pX = 

LjLiYijX Y^Im+i) = (1 - )/(l - V^)Pi + Yi(M+i) > Pi’ in view of the fact that both A+ > A; and Yt(M+\) > 

The design of L^+i, Hm+\ can be addressed through the following optimization problem. 

min P ++1 + ^ 7 +^ (20a) 

subject to; 


^(■^M+l) < 1’ Pm+1 1 

(20b) 

1-A? 

YJiM+i) <1 1 _ ;t+2-I ^ ^"+1 

(20c) 


When a plug-in request is received from subsystem M+ 1, the following design procedure must be adopted; (i) if 
(I 20 I 1 admits a solution and if, for all i G ^m+i, P,A < 1 and G{Fi) < 1, then allow the plug-in, otherwise deny it; 
(ii) properly initialize PM+i{Tpnp). 

The following corollary of Theorem [T] addresses the step (ii) and guarantees convergence of the system matrices 
Pi{k), k = 1,... ,M + 1 to steady state solutions. 

Corollary 1: If Assumption [T] holds also for i = M+ \ and if, after the plug-in event. Assumptions |4](i) and |5] are 
verified, then there exist P^ > 0 for all i= 1,... ,M such that, Pi{k) —pX as A: 00 if the following initialization 
is used; Pi{Tp„p) = Pi for all 1=1,... ,M and (a) Pm+\ (Tpnp) = 0, or (b) if Assumption |2] holds for i = M +1, 
Pm+i{Tp„p) = or (c) if Assumption [ 3 ] holds for / = M-f 1, Pm+i(?>„/>) = P^^^. □ 

Note that the initializations (b) and (c) limit possible undesirable transients on the state estimates. Note also 
that, at the (M-l-l)-th subsystem level, to solve (l20l l. the required data consist in (i) the local system matrices 
(^(m+ 1 )(m+i)iCm+i), (ii) the number g^+i of successors of subsystem M-f 1, (iii) Ajj, Hj for all j G ^m+i, 

(iv){l-Xf)/il-Xf)Pj, ^j{M+i)’ XIj for all j G X^M+i- It is therefore clear that this local design problem requires 
the transmission of a limited amount of information, i.e., through a neighbor-to-neighbor communication graph. 
Also, remark that the optimization problem ( |20] | is a nonlinear one; to simplify it, an efficient strategy amounts, 
for example, to define Hm+i as the matrix such that Fm+\ is diagonal (i.e., in case Fm+\ is diagonalizable and has 
real eigenvalues), making Hm+i depend upon Lm+i, or simply setting Hm+i = Ini- 1*^ *^bis way we can reduce the 
number of free variables of the problem. 
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B. Unplug of a subsystem 

Assume that, without loss of generality, at step 7>„/>, subsystem M is unplugged. Note that, 

. if lY then p+ = 7ij = lf=i 7j = Pi < 1; 

. if 1 e but i ^ ^M, then p + = Jij = Pi-J m 

• if! S but i ^ T/'m, then = g,- — 1 and therefore Fj^ = y^(g,-— \ )/qiFi. From this it follows that = p; but, 

at the same time, Q-A; < A,. Also Y^j = {^— Xf)/{\ — X^'^)Yij for all j ,M. Therefore 

P,^ = (i-^?)/(i-V^)p;<PP 

• i G it follows that pA = TjA = (1 — A?)/(l —AX^)(pi — YiM) < Pi, in view of the fact that both 

AA < A; and y^ = 0- 

In view of this, since Assumptions |5] and |4](i) hold before the unplug event, then they are guaranteed for the system 
deprived of the M-th subsystem. Therefore, any unplug request can be accepted, without hampering the convergence 
properties of the estimator. 

The following corollary of Theorem [T] guarantees convergence of the system matrices Pi{k), k= to new 

steady state solutions. 

Corollary 2: After the un-plug event, there exist P^> 0 for all ! = 1,... ,M such that, Pfk) —PX as A: —oo if 
the following initialization is used; Pi{Tp„p) = Pi for all i = 1,... ,M — 1. 


C. Plug and play of transducers 


In many practical applications, the sensors embedded in a subsystem can be added, removed, or replaced. We 
consider that changes occur to the M-th subsystem for simplicity, but without loss of generality. Practically, this case 
consists in a change in the matrix Cm (and, consequently. Cm), while the topology of the system and its dynamics 
remain unchanged. Therefore, for all i M, p+ = p,, since p,, i ^ M, do not depend on Cm, but only on matrices 
Aij and on the number of successors, which remain unchanged. 

On the other hand, focusing on subsystem M 

• if a transducer is plugged in, this consists of adding a row (here denoted Cadd) to matrix Cm, i-O-, 


C+ — 


Cm 

^add 


This means that the detectability properties of the pairs [Au^Ci) and (Au^Ci) are not jeopardized by the plug-in 


Lm 0 


, then p^ = Pm- This means that the 


event. Also, if Hm remains unchanged and if we take Lfj = 
addition of a new transducer does not compromize the convergence properties of the DKF scheme, 
if a sensor is replaced or unplugged, this consists of a substantial variation of the matrix Cm- This means that, 
before to allow the PnP operation, one must verify the existence of a gain Lm such that the following are 
verified; (I) Schur stability of Ff, (II) p^ < 1. Concerning the latter, note that Hm should remain unchanged, 
in order not to affect the values of Pi, i M. 
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In the case considered in this section, however, it is not clear how the PnP operation impacts on the values of 
the matrices Pi{k), i = 1,...,M. In order to guarantee the convergence of the matrices Pi{k) to a steady state, the 
following practical procedure can be adopted, suggested by Corollaries [T] and |2] (a) for k > Tpnp, make Pi{k), 
i = — 1 evolve as if subsystem M were unplugged; (b) after convergence is achieved (say, at instant 

Tconv) make Pi{k), i = 1,...,M, k > Team evolve as if subsystem M were plugged-in at time Tconv, i-e., by setting 
Pi{Tconv + 1) = Pi{Tconv), i = 1, • • • ,M and PM{Tconv + 1) = Pm' 

V. Simulation results 

In this section we provide some simulation results illustrating the application of DKF to two different examples, 
an academic one and the Hycon2 benchmark described in lIZTll . respectively. 


A. Academic example 

In this section we consider a system composed of interconnected subsystems. We set 



0.9 0.1 


_ ^ 

Aii = 


and C, = 

1 1 


0.1 -0.9 




and, for all j G Ajj =diag(a,—a), where a > 0. Also E [w,wf] = Qi = h and E [v,vf] =/?; = ! where h is 

the 2-dimensional identity matrix. 

1) Dependence on coupling and centralized design: 

First consider M — 2, with ,yV\ = JY 2 = {1,2}. In Figure [1] we show the relationship between the coupling strength 
a G [0,6.5] and (/) the spectral radius <7(r) of matrix F; (n) the spectral radius of A —LC obtained through the 
LMI-based design procedure sketched in Section IIII-AI The latter procedure has given numerically reliable results 
for a < 6.5. 

To realize the upper plot, two different choices of //, are adopted; (I) such that Fm+x is diagonal; (II) Fit = I 2 for 
all i = 1,2. As it is apparent, the spectral radius of F does not signihcantly vary in the latter cases. 

In both cases it is apparent that the small-gain procedure sketched in Section IIII-BI is applicable when the coupling 
strength is sufficiently small. Also, from the lower panel it is apparent that <7(r) < 1 is just sufficient to guarantee 
that (7(A-LC) < 1. 


2) Plug and play scenario: 

Now, assume that M — 3 and a = 0.1. At time f = 0, JF\ = jFi = {1)2} and = {3}, i.e., subsystem 3 is not 
connected with the network. In this case 


r = 


0 

0.1334 

0 


0.1334 

0 

0 


0 

0 

0 


Therefore, we have that p, < 1 for i— 1,2,3. 

At f = 100 subsystem 3 plugs in and connects with subsystem 2. More specifically jVx = {1,2}, JV 2 = {1,2,3}, 
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0.6 
0.5 
y 0.4 

io.3 

0.2 

0.1 


2 3 4 5 6 7 


Fig. 1. Upper panel: values of <7(r) as a function of a (setting Hi such that F) is diagonal - dots; setting Hi —h- circles); lower panel: values 
of (7(A —LC) as a function of a. 


and ^ = {2,3}. In this case 


r = 


0 

0.1535 

0 


0.1334 0 

0 0.1535 

0.0976 0 


The plug-in request is accepted, since p; < 1 for i— 1,2,3. 

Finally, at f = 200, subsystem 1 unplugs, meaning that = {1} JV 2 = = {2,3}. As discussed in Sec¬ 

tion IIV-BI the unplug request is automatically accepted, as it is witnessed by the values taken by the entries on F 
in this case; 

0 0 0 

r= 0 0 0.1334 

0 0.0976 0 


In Figure |2] the state trajectories are depicted, showing the different collective dynamical behaviours taken in 
correspondence with the different graph configurations. 

In Figure [3] the trajectories of the root mean estimation errors rmse,- = \/^lnsYl'tL\ for subsys¬ 

tems’ states are depicted, showing that, in view of the proper matrix initializations, when plug and play operations 
occur the estimation error does not suffer from undesirable transients. 


B. Power network benchmark 

In this section we consider a power network system including a number of power generation areas coupled 
through tie-lines. This system has been adopted also in ll29l where the authors proposed a partition-based distributed 
estimation scheme tailored to power networks applications and exhibiting promising numerical results (although 
without any theoretical guarantees). 
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Fig. 2. State trajectories. Xi^i( denotes the ^-th entry of Xi. 


Subsystem 1 



50 100 150 200 250 300 


Fig. 3. Root mean estimation error for each subsystem, obtained with DKF - solid line - and centralized KF - dotted line (the lines are 
practically overlapping). 


Our contributions are two-fold; firstly, in Section IV-BII we compare DKF with the centralized Kalman filter and 
the distributed strategy proposed in ll29l : secondly, in Section rV-B2l we test the PnP features of DKF in case a new 
subsystem is plugged in the network during its operation. 

The dynamics of each power generation area, equipped with primary control and linearized around the equilibrium 
value for all variables, is described by the following continuous time LTI model lIZTll 

Xiit) =A‘iiXiit)+B'iUi + L';APL, + ^ A‘ijXj ( 21 ) 


July 27, 2015 


DRAFT 
































15 


where x, = (A0,, A®,, AP^, , Afy,) is the state, m, = APyg/. is the control input of each area, and APl, is the local power 
load. Note that the letter A is used to denote the deviation from steady-state. The matrices of system (|2TI) are 


0 

1 

0 

0 


0 


Di 

1 

0 


0 

2Hi 

2Hi 

2Hi 

, B’i = 

0 

0 

1 

4 

1 

4 


0 

0 


1 

0 

1 


1 



RiTg. 

4,- J 


L 4, J 


0 

0 

0 

0 


0 


^ij - 

Pu 

2Hi 

0 

0 

0 

5 ^ 


1 

2Hi 


0 

0 

0 

0 


0 



0 

0 

0 

0 


0 



where the parameters and their numerical values are defined in ll27l . Since both APre/. and AP/,, are assumed to be 
constant and known, for the sake of simplicity, we neglect them from our analysis. 

We discretize the process (l2Tli with a sampling interval T according to the technique proposed in ITO). leading to 
the discrete-time model ([T]i where the matrices An, Aij can be easily constructed from (l2lT i. The matrix C, is 


Ci = 


10 0 0 
0 10 0 


For i £ {1,... ,M}, E [w,wf ] = Qi = 2> I 4 and E [v,vf ] = P, = h where 4 is the k-dimensional identity matrix. 

1) Comparison test: 

In this section we consider the scenario 1 in IET), where M = 4 and where the adjacency matrix Ad, defining the 
neighboring relationships between areas, is 


Ad = 


0 10 0 
10 10 
0 10 1 
0 0 10 


namely, Adij ^ 0 if and only if ^ ^ 0. In Figure |4] we depict A0i and its estimate A0i generated by the DKF 
algorithm. 

In Figures |5] and |6] we compare the performance of DKF algorithm with that of the centralized Kalman predictor 
and of the distributed strategy proposed in ll29l . In Figure |3 we plot the normalized estimation error e{t) defined as 


e{t) = ^||x(f)-x(f)|| 

for the first 100 iterations. In Figure [6] we plot e{t) from f = 30 up to ? = 100 (i.e., in stationary conditions). In 
the Table I^BT] we report the average value of the estimation error evaluated between iteration 30 and 100. 



Centralized 

DKF 

Strategy in [29] 

Error Mean 

13.74 

14.08 

17.21 
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Fig. 4. Trajectory of A0i (blue line) and its estimate (red line), obtained with DKF 



Fig. 5. Trajectory of e(t) obtained with DKF (red line), with a centralized Kalman predictor (blue line), and with the method proposed in 
(black line), with t € [0,100]. 


Notice that the performance of DKF algorithm is quite close to the performance of the centralized Kalman filter 
and that it outperforms the performance of the strategy in ll29l . Additionally Assumptionis satisfied with //,■ =h. 
2) Plug and play scenario: 

In this section we consider a PnP scenario. Specifically we assume that at time step 50 a new area (i.e., area 5) is 
added to the power network, and that, in particular, it gets connected to area 2. Again the values of the parameters 



Fig. 6. Trajectory of e[t) obtained with DKF (red line), with a centralized Kalman predictor (blue line), and with the method proposed in ED 
(black line), with t G [20,100]. 
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Fig. 7. Trajectory of e(t) obtained with DKF (red line) and with a centralized Kalman predictor (blue line) in the PnP scenario. 


defining area 5 can be found in lIZTll . The adjacency matrix describing the interconnections after step 50 is 


Ad = 


0 10 0 0 
10 10 1 
0 10 10 
0 0 10 0 
0 10 0 0 


In Figure IV-B2I we depict the behavior of the estimation error for both the centralized Kalman filtering algorithm 
and DKF. Observe that, also in this plug and play scenario the performance of the DKF algorthm is comparable 
with that of the centralized Kalman filter. As expected, when a new area is added to the network the value of e{t) 
increases mainly due to the poor estimation quality concerning the state of the area 5. However, after few iterations 
the value of e{t) settles around a value which is comparable to its value before the addition of the new area. 


VI. Conclusions 

In this paper a novel partition-based distributed observer based Kalman filter, named DKF, is proposed. The main 
advantages of the discussed state estimator are: (i) scalability, in terms of both computational and communication 
loads required for the online operations; (ii) the convergence properties can be proved under mild conditions; (Hi) 
distributed and plug and play design are allowed. In fact, not only centralized (although aggregate) but also distributed 
conditions for estimation convergence are given, which confer reconfigurability to the proposed estimation scheme. 
Simulation tests are provided to illustrate the effectiveness of DKF. For example, we have considered a well-known 
benchmark example, proposed in the framework of the Hycon2 Project. Future work include the application of 
DKF to a real test case, e.g. smart grids. 


Appendix 

The following preliminary result is needed for the proofs of both Lemma [T] and Proposition [T] 

Lemma 2: Define P(k) = diag(Pi(k),...,P m(^))- If Pi{k) i = 1,...,M are updated according to ®, then 

P(kH-l) >.^(P(k),A,C,Q,R). (22) 
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Proof of Lemma |2] 

Since P(A:) is block-diagonal define 

V^{k) = P(k)-P(k)C^(CP(k)C^-bR)-'CP(k) 

= diag(Pf(k),...,P£(k)) 

where 

p[{k) = Pfk) - Pi{k)Cj{CiP,{k)Cj +Ri)-^CiP,{k) > 0 (23) 

for all i= 1,... ,M. Furthermore, it holds that 

AP^ (k) -f Q < diag(Pi (k + 1),..., Pm (^ + 1)) (24) 


where 


M 


Pi{k+\)=Y^qjAijP’;{k)Al + Qt 

f=i 


(25) 


which is equivalent to ®. Inequality (l24l i can be proved as follows. Define a vector v = (vi,..., vm), where v, S R”' 
for all i= 1,... ,M. We compute that v^AP^(k)A^v = 

yM aT 

Li=iAiiV, 


&yjAn ... 


M ,,r. 


L;=l j 


P^(k) 


y^ Ai y. 


(26) 


where Wij = {^)AfjVi- Remark that w,y = 0 identically iff Ay = 0, and that the number of nonzero vectors 
w.j is equal to gj. We compute that Y4L1 ^ij = llr,sey- Note that, since ||ws; — Wrj||^ > 0, wJjWsj < 

1 + wPwsj). Therefore y^rj^sj < 5 Lr,seJ^j {\\»^rjf+ \\wsj\\^) = Gj Liey, 11 IP = Lie^j 11 11 l^A..pF • 

From this, it follows that 


If=l (lEl ^ij) < l|v/| 


ijAijpfm, 


= 1 ? 


M 


= 1 II ^nlv^^ 


M . vF(iAaT 


= diagd" 1 qAi^P/ (k)A{^.,... ^ g^AujP^ {k)Ayy 

from which (l24l i readily follows. 

Proof of Lemma Q] 

From Lemma 12] one has that 


□ 


P > (A - LC)P(A - LC)^ + Q + LRL^ 


(27) 


where the block entries of L are Ly = ^{Pj,Aij,Cj,Rj), which is equivalent to L = ,5f(P,A,C,R). The latter 
follows from the fact that L = Adiag(Lj^,... ,L^), where diag(L(^,... ,Lm) ~ PC^(CPC^ +R) \ which is block- 
diagonal in view of the block-diagonality of C, P, and R. Assume, by contradiction, that (A — LC) is not Schur 
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Stable. Therefore, there is at least an eigenvalue/eigenvector pair X,v of (A —LC) such that (A —LC)^v = Av and 
|A| > 1. From dZTl) 

v^Pv > (A - LC)P(A - LC)^V + v^Qv + v^LRL^v 

from which it follows that (1 — |Ap)v^Pv > v^Qv +v^LRL^v. Since the right hand side of the latter inequality is 
> 0 and |A| > 1, the only possibility is that |A| = 1, v^Qv = 0, and L^v = 0. In view of this, A^v = v and G^v 
should hold at the same time which, recalling the PBH test, is in contradiction with the assumption that the pair 
(A,G) is stabilizable. This concludes the proof of Lemma [1] □ 

Proof of Proposition Q] 

As a preliminary step, we show that, if n;/(l) < P(l), then Tld{k) < P(A) for all k >0. This can be proved 
using induction arguments. Assume that, at instant k, it holds that Ild{k) < P(A). Recalling Lemmawe have that 
P(A+1) >,!^(P(A),A,C,Q,R), where .!^(P(A),A,C,Q,R) = (A-L(A)C)P(A)(A-L(A)C)^+L(A)RL(A)^+Q. 
From this and (fTOl i it results that 

P(A+l)-nrf(A+l) > (A-L(A)C)(P(A)-n,,(A))(A-L(A)C)^ >0 

Therefore, Yld{k+ 1) < P(A+ 1). By applying induction arguments, we can prove that n^(A) < P(A) for all k>Q. 
If P(A:) —P as A —oo, then L(A) L such that, in view of Lemma[T] A —LC is Schur stable. Consider the evolution 
of matrix Ild{k). From the stability of A —LC, then Ild{k) lid, for all initial conditions n(/(l), where lid is the 
unique solution to the Lyapunov equation fid = (A —LC)n^(A —LC)^+Q + LRL^. If we set n^(l) =0, from the 
preliminary result then Ild{k) < P(A) for all A > 0 and fid < P. Noting that fid is the unique steady-state attained 
for all possible initial conditions the proof is concluded. □ 

The proof of Theorem [T] heavily relies on classical results on Kalman filters, e.g., a, a, a, a. Similarly to 
well known results on the discrete-time Riccati equation, we need two intermediate results. 

Lemma 3: If Pf(k) > Pf(k) for all j =1,... ,M, then Pf{k+ 1) > PfikP 1) where Pf‘{k+ 1) and Pf{k+ 1) are 
the matrix evolutions, obtained with ®, starting from Pf{k) and Pf{k), respectively. 

Proof: Note that we can write ® as 

m+l) = l^%{Aij-Ltj{k)Cj)Pj{k){Aij-Li,{k)CjY 
+L,' j {k)R jLij (k) + Qi 

where, according to the classical Kalman filter theory, Lij(k) = Jf{Pj{k),Aij,Cj,Rj) minimizes the term {Aij — 
Lij{k)Cj)Pj{k){Aij — Lij{k)CjY + Lij{k)RjLij{kY for all i,i = 1,...,M. Therefore, consider the matrices pA, Pf, 
where Pf > P^ for all / = 1,... ,M, and optimal the gains Lfj and corresponding to Pf and P^ respectively, 
then for all j, {Atj - lYjCj)Pf{Aij - lYfjY + lYjRjilYjV < (Au - i4jCj)Pf{Aij - l4jCjY + i4jRjY4jV < i^ij- 
LfjCj)P^{Aij — ifjCjY + LfjRj{LfjY, and the proof is concluded. □ ■ 

Lemma 4: If Assumptions [T] (for all i = 1,..., M) and |4] hold then, for all P (1) >0 i= 1,..., M, there exist p^^^ 
for all i= 1,... ,M such that Pfk) < pM^^ for all A > 0. 
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Proof: An alternative formulation of ([Sj is 

P,{k)=lfik)+A,ik) (29) 

where Pf^{k + 1 ) = AuPf {k)Al + Qi = {An - Lu{k)Ci)P,{k){Au - Ln{k)Cif + Ln{k)RiLl {k) + Qi and A,(^ + 1) = 
Y.j^iAijPj{k)Afj, being P^(k) defined in (|2^ . In view of Assumption [T] we can write Pf (^—1) =Aj-^{Pf'{k) — 
Qi){Aj^Y- Therefore 

A,{k) = - Qj) (^ 7 / (30) 

Since Fi = {An — LtCt) is Schur stable (thanks to Assumption |4] (i)) and L, is a suboptimal gain 

PL{k+l)<Fi{Pt{k)+Ai{k))Ff + Qi+LiRi{LiY 


Solving the latter we obtain; 


Pt{k)<Fl‘-'pt{l){F,Y‘‘-' 


(31) 


4-1 


+ {FMk-h)Ff+ Qi+liRi{LtY) {Ff) 




h=l 


Using the transformation matrices Fit, we define = HiPYk)H[, Qt = FliQiHf Qt = Hi{Qi +LjRi{LiY)F[[. 

Note also that HiAijAj^^Hy^ = HiAjjHj^^HjAjjHj^^ = AjjAjjP In view of this and (l30l l. we can rewrite OTT i as 
follows 


h=\ 


+ L L ip^k -h)- Qp 

j^ih=l 


(32) 


Recalling that /’j'(k) > P^k) — QjlAO for all 7 = 1,... ,M, we have that 


II^^WII<l|7;-^-7Pll^7(i)ll + Ell^/lPy^ 

/i=i 


4-1 


Mi 


(33) 


Therefore 


4=1 


\\pt{k)\\<^fxY\\pKi)\\ + 


pt 


i-A, 


iiiai 


-LiiA,,4T(^x||^(fi)ii)^ 


(34) 


Denoting nj{k) = max/,g[o jt] ||4(^)ll’ *(341) implies that 0 < ni{k) < qi + Zj^iYunjik) where q, = nf\\PYl)\\ + 
2 2 

j^WQiW and Yij = Apll^0^7/lP- Finally denote the vectors n(k) = (ni(k),... and q= {qi,...,qM). 

We obtain that 


{Im - r)n(k) < q 


(35) 
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According to Q, if the spectral radius of F is strictly smaller than one, for every initial condition (see, e.g.. Lemma 
13 for the general nonlinear case), the solution to the system (l29l l exists and is uniformly bounded, since q does 
not depend on k. □ 


Now we are in the position to provide the proof of Theorem [T] 

Proof of Theorem |7] 

First consider all the initializations a, b, and c. 

a. In case P,(l) = 0 for all / = 1,... ,M then P,(2) > 0 = ^((1) for all i = 1,... ,M. 

b. Set P,(l) = Pf for all i= Note that matrices Pj^ exist and are unique for all / = 1,... ,M in view of 

Assumption |2] From (O, for all i= 

Pi{2) = gifg{P^,An,Ci,Qi,Ri)+^je^{t} gjfg{P^,Aij,Cj,0,Rj) 

> fg{Pf,Aii,Ci,Qi,Ri) =P^= Pfl) 

c. Set P,(l) = Pj^ for all / = 1,... ,M. Note that matrices Pj^ exist and are unique (for all / = 1,... ,M) in view 

of Assumption [3 Set Pi{l) = P^ for all i= Then, from ((3), for all i = 

Pi{2) = fg{P^,Aii,Q,Qi,Ri)+Lje^{i}^{P^AijA,0,Rj) 

> ff{p^,AiiA,QuRi) =A = PA) 

In all cases, applying induction arguments and in view of the monotonicity property (i.e.. Lemma 13, Pi{k+ 1) > 
Pi{k) for all k > 1 and for all i = 1,...,M. Therefore the sequence of matrices P(^) =disig[Pi{k),... ,PM{k)) is 
monotonically increasing, in the sense that P(A:+1) > P{k) for all k. In view of the boundedness property (i.e.. 
Lemma II, there exist P,- for all i, such that Pfk) —^ P, as A: —^ oo. □ 

Proof of Proposition \2\ 

The proof easily follows from the Gershgorin circle theorem. Indeed, each eigenvalue of F lies in at least one of 
the M Gershgorin circles, i.e., since ja = 0 for all i, the values of A satisfying |A| < pi = 

each i= Then, if p, < 1 for all i= all eigenvalues verify |A| < 1. □ 

Proof of Corollary Q] 

When plug-in events take place, if j <M is a neighbor (also said predecessor in ll26ll l of M+ 1, then gj = gj + l, 
otherwise gf = gj. In view of this, AA = ^(for / g CA = ^and pA = ^^Rj for all j G cAm+i; 
otherwise AA =Aij, Cj = Cj, and pA = Rj, Therefore, for all initializations and for all i = 1,... ,M 

PiiTp„p+l) = ^j^^+tM{Pj{Tp„p),Atj,CtARj)+Qi 

= Lje^. ^flM{Pj{Tp„p),AijAjARj) + Qi 

+^{Pm+i iPpnp)A^(^M+\)AM+i^^Ati+\) 

— + Qi — Pi — Pi{T^Pnp) 
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a. if Pm+\(T pnp) = 0, Pm+i {TpnP + 1) > 0. 

b. \i PM+\{Tpnp) ~^M+V PM+l{TpnP+^) ^ Cm+\tQm+\^Rm+i) — Pm+\ = PM+\{Tpnp), 

C. \f PM+\{Tpnp) — Pm+V ^M+l(7f>„/>+ 1) ^ ^{PM+\^'^^M+i){M+l)' ^M+1 ’ 2^+1’^M+1) “ ^M+1 — PM+\{Tpnp) 

In all cases it follows that, for all 1=1,... ,M+ 1, Pi{Tp„p+ 1) > Pi{Tpnp). Applying an induction argument and in 
view of the monotonicity property, Ptik+l) > Piik) for all k > Tp„p and for all i= Therefore the sequence 

of matrices P(A:) =diag{Pi(k),... ,PM{k)) is monotonically increasing, in the sense that P(A:+1) > P(A:) for all 
^ > TpnP- Since Assumption |4] (ii) holds in view of Assumption |5] and Proposition |2l the boundedness property 
holds in view of Lemma|4] and therefore there exist P^ for all i= 1,... ,M+ 1, such that Pi{k) PP as k^ oo.D 


Proof of Corollary |2] 

When the unplug event takes place, if j < M is a neighbor of M, then = gj—l, otherwise gf = gj. In view of 
this, A^j = if i G SCj, and and = ^‘^Rj for all j G ^m', otherwise Aj = Cj, 

and R^ = Rj. Therefore, for all 1 = 1,... ,M — 1 


P.iTpnP +1) = fg{Pj{Tp„p),A+,C+,Q,Rj) + Q. 

= ^ffgiPjiTpnp),Aij,Cj,0,Rj) + Q: 

< Lje^f^{PjiPpnp)Aj:Cj,0,Rj)+Qi 


= 'Lje^i^{Pj{Tpnp)Aij,Cj,0,Rj)+Qi 

—^{PM{Tpnp),AiM,CM,0,RM) < Pi = Pi{Tpnp) 

Then, applying an induction argument and in view of the monotonicity property, P,(A: +1) < Pfk) for all k > Tp„p and 
for all 1 = 1,... ,M. Therefore the sequence of matrices P(k) =diag(Pi(A:),... ,PMik)) is monotonically decreasing, in 
the sense that P(k +1) < P(k) for all k > Tp„p. In view of the fact that P(k) > 0, there exist pA for all / = 1,... ,M +1, 
such that Pi{k) —>■ P^ as k ^ oo, □ 
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